Improving Question Answering Sentence Selection by Rank Propagation
نویسندگان
چکیده
Open-domain Question Answering (QA) systems typically leverage an answer selection component to rank candidate answer sentences based on how likely they will contain the answer to a given question. This component plays a crucial rule in the QA system as it usually dictates how downstream processing modules (e.g., answer extraction) retain and present answers to users. Most existing works in this field use machine learning techniques that captures relations between question and answer sentences to train models for ranking answer candidates. In this paper, we introduce RankProp, a method that can be used as a post processing step for improving the quality of the ranking results from classifiers. Based on the assumption that similar sentences should be ranked closer and answer sentences that ranked at the top by classifiers are more likely to contain the answer, our algorithm makes use of similarities among the answer sentences and propagate this information. It takes ranking scores and similarities as input and uses convex optimization to perform further ranking adjustment. Similarities can be evaluated through word embedding techniques. In addition, if the expected answer type can be generated from the answer type prediction module, our algorithm will also incorporate this information together with the entity types extracted from the candidate answers to further improve the quality of the ranking. Experimental results on Jacana-QA system demonstrate that our method generally improve the performance of ranking module by 5% on average. Moreover, this RankProp algorithm is generic in the sense that it does not make any assumption about how the overall QA system works. Therefore, in principle, it can be inserted into an existing QA system to improve its performance without modifying any other components.
منابع مشابه
Optimizing question answering systems by Accelerated Particle Swarm Optimization (APSO)
One of the most important research areas in natural language processing is Question Answering Systems (QASs). Existing search engines, with Google at the top, have many remarkable capabilities. But there is a basic limitation (search engines do not have deduction capability), a capability which a QAS is expected to have. In this perspective, a search engine may be viewed as a semi-mechanized QA...
متن کاملQuestion Answering Using Enhanced Lexical Semantic Models
In this paper, we study the answer sentence selection problem for question answering. Unlike previous work, which primarily leverages syntactic analysis through dependency tree matching, we focus on improving the performance using models of lexical semantic resources. Experiments show that our systems can be consistently and significantly improved with rich lexical semantic information, regardl...
متن کاملFAQ-based Question Answering via Word Alignment
In this paper, we propose a novel wordalignment-based method to solve the FAQbased question answering task. First, we employ a neural network model to calculate question similarity, where the word alignment between two questions is used for extracting features. Second, we design a bootstrap-based feature extraction method to extract a small set of effective lexical features. Third, we propose a...
متن کاملRMIT at the NTCIR-12 MobileClick-2: iUnit Ranking and Summarization Subtasks
[1] R-.C. Chen, D. Spina, W.B. Croft, M. Sanderson, and F. Scholer. Harnessing Semantics for Answer Sentence Retrieval. In Proceedings of ESAIR'15, 2015 [5] D. Metzler and T. Kanungo. Machine Learned Sentence Selection Strategies for QueryBiased Summarization. In Proceedings of SIGIR 2008 Learning to Rank Workshop, 2008 [7] L. Yang, Q. Ai, D. Spina, R.-C. Chen, L. Pang, W.B. Croft, J. Guo, and ...
متن کاملWhat is the fastest car in the world ?
In this paper, we study the answer sentence selection problem for question answering. Unlike previous work, which primarily leverages syntactic analysis through dependency tree matching, we focus on improving the performance using models of lexical semantic resources. Experiments show that our systems can be consistently and significantly improved with rich lexical semantic information, regardl...
متن کامل